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Abstract. An atom of a regular language L with n (left) quotients is a 
non-empty intersection of uncomplemented or complemented quotients 
of L, where each of the n quotients appears in a term of the intersection. 
The quotient complexity of L, which is the same as the state complexity 
of L, is the number of quotients of L. We prove that, for any language 
L with quotient complexity n, the quotient complexity of any atom of L 
with r complemented quotients has an upper bound of 2™ — 1 if r = 
or r = n, and 1 + Yuk=i ^2h=k+i Cft ' Cfc otherwise, where C] is the 
binomial coefficient. For each n ^ 1, we exhibit a language whose atoms 
meet these bounds. 



1 Introduction 

Atoms of regular languages were introduced in 2011 by Brzozowski and Tamm [3]; 
we briefly state their main properties here. 

The (left) quotient of a regular language L over an alphabet S by a word 
w G S* is the language w~ x L = {x G S* \ wx G L}. It is well known that a 
language L is regular if and only if it has a finite number of distinct quotients, 
and that the number of states in the minimal deterministic finite automaton 
(DFA) recognizing L is precisely the number of distinct quotients of L. Also, L 
is its own quotient by the empty word e, that is e~ 1 L = L. Note too that the 
quotient by u G S* of the quotient by w G S* of L is the quotient by wu of L, 
that is, u~ 1 (w^ 1 L) = (wu)^ 1 L. 

An atom 3 of a regular language L with quotients Kq, ■ ■ ■ , K n -i is any non- 
empty language of the form Kq H • • • D K n _i, where Ki is either Ki or Ki, and 
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quotients to be an atom. Our new definition adds symmetry to the theory. 



Ki is the complement of Ki with respect to E* . Thus atoms of L are regular 
languages uniquely determined by L and they define a partition of £*. They are 
pairwise disjoint, every quotient of L (including L itself) is a union of atoms, 
and every quotient of an atom is a union of atoms. Thus the atoms of a regular 
language are its basic building blocks. Also, L defines the same atoms as L. 

The quotient complexity [2] of L is the number of quotients of L, and this is 
the same number as the number of states in the minimal DFA recognizing L; 
the latter number is known as the state complexity [8] of L. Quotient complexity 
allows us to use language-theoretic methods, whereas state complexity is more 
amenable to automaton-theoretic techniques. We use one of these two points of 
view or the other, depending on convenience. 

We study the quotient complexity of atoms of regular languages. Suppose 
that L C S* is a non-empty regular language and its set of quotients is K = 
{Ko, Ki,..., K n -i}, with n > 1. Our main result is the following: 

Theorem 1 (Main Result). 

For n > 1, the quotient complexity of the atoms with or n complemented 
quotients is less than or equal to 2™ — 1. For n ^ 2 and r satisfying 1 ^ r ^ n — 1, 
the quotient complexity of any atom of L with r complemented quotients is less 
than or equal to 

r k+n—r 

f(n,r) = l + J2 E C h' C l 

k=l h=k+l 

For n = 1, the single atom S* of the language S* or meets the bound 1. 
Moreover, for n > 2, all the atoms of the language L n recognized by the DFA V n 
of Figure 1 meet these bounds. 




Fig. 1. DFA T) n of language L n whose atoms meet the bounds. 

In Section 2 we derive upper bounds on the quotient complexities of atoms. In 
Section 3 we define our notation and terminology for automata, and present the 
definition of the atomaton [3] of a regular language; this is a nondctcrministic 
finite automaton (NFA) whose states arc the atoms of the language. We also 
provide a different characterization of the atomaton. We introduce a class of 
DFA's in Section 4 and study the atomata of their languages. We then prove 
in Section 5 that the atoms of these languages meet the quotient complexity 
bounds. Section 6 concludes the paper. 
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2 Upper Bounds on the Quotient Complexities of Atoms 



We first derive upper bounds on the quotient complexity of atoms. We use quo- 
tients here, since they are convenient for this task. First we deal with the two 
atoms that have only uncomplemented or only complemented quotients. 

Proposition 1 (Atoms with or n Complemented Quotients). 

For n > 1, the quotient complexity of the two atoms Ak = Kq fl • • ■ fl if„_i and 
A$ = Kq fl • ■ • fl is less than or equal to 2™ — 1. 

Proof. Every quotient w~ 1 Ak of atom Ak is the intersection of languages 
w~ 1 K i , which are quotients of L: 

w^A K = w^iKa n • • • n K n _i) = w^Ko n • • • n io _1 ^ n _i. 

Since these quotients of L need not be distinct, w~ 1 Ak may be the intersection 
of any non-empty subset of quotients of L. Hence Ak can have at most 2™ — 1 
quotients. 

The argument for the atom A$ = Kq fl • • • fl K n _\ with n complemented 
quotients is similar, since w~ 1 K i = w~ 1 K i . □ 

Next, we present an upper bound on the quotient complexity of any atom 
with at least one and fewer than n complemented quotients. 

Proposition 2 (Atoms with r Complemented Quotients, 1 ^ r ^ n— 1). 

For n ^ 2 and 1 ^ r ^ n — 1, the quotient complexity of any atom with r 
complemented quotients is less than or equal to 

r k+n — r 

/(n,r) = l + £ E C h' C l (!) 

k=l h=k+l 

where C* is the binomial coefficient "i choose j ". 

Proof. Consider an intersection of complemented and uncomplemented quotients 
that constitutes an atom. Without loss of generality, we arrange the terms in the 
intersection in such a way that all complemented quotients appear on the right. 
Thus let Ai = Ko fl ■ • ■ fl if„_ r _i n K n - r D • • • fl K n -\ be an atom of L with r 
complemented quotients of L, where 1 ^ r ^ n — 1. The quotient of Ai by any 
word w G S* is 

w^Ai =w- 1 (K r\---n Kn-r-x n K n - r n • • • n K~i) 

= w~ 1 k n • • • n w~ 1 K n - r -i n w- 1 K n - r n • • • n w^Kn-i. 

Since each quotient w~ 1 Kj is a quotient, say K^, of L, we have 

w^Ai = K io n • • • n K in _ r _, nK~n---nK~- 

The cardinality of a set S is denoted by \S\. Let the set of distinct quotients 
of L appearing in w~ 1 A i uncomplemented (respectively, complemented) be X 
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(respectively, Y), where 1 \X\ sC n - r and 1 < \Y\ r. If X n F ^ 0, 
then = 0. Therefore assume that X n Y = 0, and that \X\JY\ = h, 

where 2 ^ h < n; there are C£ such sets luy. Suppose further that F = k, 
where 1 < k ^ r. There are ways of choosing Y\ Hence there are at most 
Shife+i ^h'^k distinct intersections with k complemented quotients. Thus, the 
total number of intersections of uncomplemented and complemented quotients 
can be at most £Li EtS+T ^ ■ C* 

Adding 1 for the empty quotient of w 1 Ai, we get the required bound. □ 

We now consider the properties of the function f(n, r). 

Proposition 3 (Properties of Bounds). For any n ^ 2 and 1 ^ r < n — 1, 

1. f(n,r) = f(n,n-r). 

2. For a fixed n, the maximal value of f(n,r) occurs when r = [n/2\. 

Proof. Since f(n,r) = 1 + J2k=i Z)/Sk+i ' ^fc> anc ^ tne following equations 
hold: 

r k-\-n—r r n — r n — r r 

E\ ^ n n rth _ \ ~* \ ~* s-m r< k + l — \ ^ \ ^ r< n r< k + l 
fc=ift=fe+i fe=i ;=i i=i fc=i 

n—r r n—r l+r 

;=i fe=l (=1 m=l+l 

we have /(n, r) = f(n, n — r). 

For the second part, we will assume that 1 ^ r < [n/2\, and show that 
f(n, r + 1) > /(n, r) for this case. We find f(n, r + 1) — f(n, r) as follows: 

r+1 fc+ro— i — 1 r k+n — r 

/(n,r + l)-/(n,r) = l + £ £ £ C£ • <#) 

fc=l fc=fc+l fc=l fc=fc+l 

r+1 fc+n— r— 1 r fc+n— r 

- £ £ c?-c*-£ £ c£.c£ 

fc=r+l /i— fc+1 fe=l /i— /c+n — r 

n r 

E/^tn y^f/? \ A s~iTi s~ik-\-n—r 

/i=r+2 fc=l 

Since the first summation can be written as 

n n—r n 

£ c n h -c h r+1 = £ c^-c r fc +1 + £ c n h -c h r+1 

h=r+2 h=r+2 h=n-r+l 



E/~tn /~th I \ sin s-ik+n- 
' °r+l "T 2^ °fc+n-r ' °r+l 



h=r+2 k=l 
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we get 

f(n, r + 1)- f(n, r) = £ C£ ■ C? +1 + £ C[l +n _ r ■ 

h=r+2 k=l 
r 

^k+n-r ■ U fc 

k=l 

Assuming 1 < k ^ r, we will show that C*+[ l - r > C k k +n - r . We can express 
the ratio C k+n-r jfjk+n-r as follows: 

C*+T~ r (fc + n-r)! , (fc + n-r)! 

C*+ n ~ r ~ (r + l)\(k + n - 2r - 1)! ' fc!(n - r)! 
kl(n — r)\ 
= (r + l)\(k + n - 2r - 1)\ 
_ k\(n - r) ■ • ■ (n - 2r + k)(n - 2r + k - 1)! 
(r + 1) • • • (k + Tjk\(n - 2r + k - 1)! 
(n — r) ■ ■ ■ (n — 2r + k) 
= (r + 1) •••(£ + !) ' 

Note that there are r — k + 1 factors both in the numerator and the denominator 
of the obtained fraction. Therefore, we can write 

Cr+r _r n — r n — r — 1 n — 2r + k 

Cl+ n - r ~ r + 1 ~ ' k + 1 

The condition 1 ^ r < [n/2\ implies that n > 2r + 1; consequently we have 

n — r > r + 1, n — r — 1 > r, n — 2r + fc > + 1. 

Therefore C k +i~ r /C k k +n - r > 1, which implies that C k +?~ r > C k k +n - r . 
It follows that 

r r 

Es-in s-ik+n — r \ - ^k+n — r 

°fc+n-r ' W+l ^fc+ri-r ' u fc i 

fe=l fe=l 

and /(n, r + 1) - f(n, r) > 0. So, if 1 r < |n/2j , then /(n, r + 1) > /(n, r). 
Since /(n, r) = /(n, n — r), the maximum of f(n, r) occurs when r = \n/2\ . □ 

To better illustrate the properties of /(n,r), we derive explicit formulas for 
the first three values of r. Using the well-known identity 

n 

]T Cl ■ C h k = 2"- fc C fc ", (2) 

h=k 
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wc find 



/(n,l) = n2 n ~ x -n + 1, 

f(n, 2) = n2 n ~ 1 -2n + lill^ii ( 2 ™- 2 - 1) + 1, 

f(n, 3) = n2 n ~ 1 - (n 2 + n) + "("~ 1 )("+ 4 ) (2"~ 3 - 1) + 1. 

Some numerical values of /(n, r) are shown in Table 1. The figures in boldface 
type are the maxima for a fixed n. The row marked max shows the maximal 
quotient complexity of the atoms of L. The row marked ratio shows the value of 
f(n, [n/2\)/f(n — 1, [(n — 1)/2J ) , for n > 2. It appears that this ratio converges 
to 3. For example, for n = 100 it is approximately 3.0002. 



Table 1. Maximal quotient complexity of atoms. 



n 


1 


2 


3 


4 


5 


6 


7 


8 


9 


10 




r=0 


1 


3 


7 


15 


31 


63 


127 


255 


511 


1,023 




r=l 


1 


3 


10 


29 


76 


187 


442 


1,017 


2,296 


5,111 




r=2 


* 


3 


10 


43 


141 


406 


1,086 


2,773 


6,859 


16, 576 




r=3 


* 


* 


7 


29 


141 


501 


1,548 


4,425 


12, 043 


31,681 




r=4 


* 


* 


* 


15 


76 


406 


1,548 


5,083 


15,361 


44,071 




r=5 


* 


* 


* 


* 


31 


187 


1,086 


4,425 


15,361 


48, 733 




max 


1 


3 


10 


43 


141 


501 


1,548 


5,083 


15,361 


48, 733 




ratio 




3 


3.33 


4.30 


3.28 


3.55 


3.09 


3.28 


3.02 


3.17 





3 Automata and Atomata of Regular Languages 

If E is a non-empty finite alphabet, then E* is the free monoid generated by E. 
A word is any element of E* , and the empty word is e. A language over E is 
any subset of E*. The reverse of a language L is denoted by L R and defined as 
L R = {w R | w € L}, where w R is w spelled backwards. 

A nondeterministic finite automaton (NFA) is a quintuple M — (Q, E, i], I, F), 
where Q is a finite, non-empty set of states, S is a finite non-empty alphabet, 
n : Q x S — > 2® is the transition function, I C Q is the set of initial states, and 
F C Q is the set of final states. As usual, we extend the transition function to 
functions rf : Q x E* ->■ 2 Q , and 77" : 2 Q x E* ->■ 2 Q . Wc do not distinguish 
these functions notationally, but use r\ for all three. 

The language accepted by an NFA M is L(M) = {w e E* r,(I, w) D F ^ 0}. 
Two NFA's are equivalent if they accept the same language. The right language 
of a state q of Af is L q ^ F (Af) — {w e E* \ r](q, w) n F ^ 0}. The right language 
of a set S of states of A/" is L s ,f(AT) = \J qe s L q,F(Af); hence L(Af) = L IfF (AT). 
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A state is empty if its right language is empty. Two states of an NFA are equiv- 
alent if their right languages are equal. The left language of a state q of Af is 
Li, q = {w 6 S* | q E r/(I,w)}. A state is unreachable if its left language is 
empty. An NFA is trim if it has no empty or unreachable states. 

A deterministic finite automaton (DFA) is a quintuple T> — (Q, S, 5, qo, F), 
where Q, S, and F are as in an NFA, S : Q x E — > Q is the transition function, 
and qo is the initial state. A DFA is an NFA in which the set of initial states 
is {go} and the range of S is restricted to singletons {q}, q 6 Q. Note that an 
empty state of Af is an unreachable state of Af R and vice versa. 

We use the following operations on automata: 

1. The determinization operation U> applied to an NFA Af yields a DFA Af n 
obtained by the well-known subset construction, where only subsets reach- 
able from the initial subset of Af° are used and the empty subset, if present, 
is included. 

2. The reversal operation R applied to an NFA Af yields an NFA A/" R , where 
sets of initial and final states of Af are interchanged and each transition 
between any two states is reversed. 

From now on we consider only non-empty regular languages. Let L be any 
such language, and let its set of quotients be K — {K , . ■ . , K n _i}. One of the 
quotients of L is L itself; this is called the initial quotient and is denoted by K in . 
A quotient is final if it contains the empty word e. The set of final quotients is 
F = {Ki | s e Ki}. 

In the following definition we use a one-to-one correspondence Ki <-> Kj 
between quotients Ki of a language L and the states Kj of the quotient DFA T> 
defined below. We refer to the Kj as quotient symbols. 

Definition 1. The quotient DFA of L is V = (K, S, 5, Kj„, F), where K = 
{Ko, . . . , K„_i}, Kj„ corresponds to Ki n , F = {K^ | Ki 6 F}, and S(Ki, a) = 
Kj if and only if a~ 1 K i = Kj, for all Kj, Kj e K and a € S. 

In a quotient DFA the right language of is Ki, and its left language 
is {w e S* | w^ 1 L = Ki}. The latter is the equivalence class of the Nerode 
equivalence [5]. The language L(T>) is the right language of Kj„, and hence 
L(T>) = L. Also, DFA V is minimal, since all quotients in K are distinct. 

It follows from the definition of an atom, that a regular language L has at 
most 2" atoms. An atom is initial if it has L (rather than L) as a term; it is final 
if it contains e. Since L is non-empty, it has at least one quotient containing e. 
Hence it has exactly one final atom, the atom K n • • • fl K n _\, where Ki = Ki 
if s G Ki, and Ki = Ki otherwise. Let A = {Ao, . . . , A m ^i} be the set of atoms 
of L. By convention, / is the set of initial atoms and A m -\ is the final atom. 

As above, we use a one-to-one correspondence A; o Aj between atoms Ai 
of a language L and the states Aj of the NFA A defined below. We refer to the 
Aj as atom symbols. 
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Definition 2. The atomaton 4 of L is the NFA A — (A, S, r\, I, {A TO _i}), where 
A = {Ai | Ai G A}, I = {Ai | e /}, A m _i corresponds to A m _\, and 
Aj £ r](Ai,a) if and only if aAj C Aj, for all Aj, Aj e A and a £ E. 

Example 1. Let L2 C {a,c}* be defined by the quotient equations below (left) 
and recognized by the DFA V 2 of Fig. 2 (a). The equations for the atoms of 
L2 are below (right), and the atomaton A2 is in Fig. 2 (b); here each atom is 
denoted by Ap, where P is the set of uncomplemented quotients. Thus K fl K\ 
becomes ^4{o}> etc-, and we represent the sets in the subscripts without brackets 
and commas. The reverse 2?f of £>2 is in Fig. 2 (c). The determinized reverse X>™ 
is in Fig. 2 (d); this is the minimal DFA for Lf", the reverse of The reverse 
A\ of the atomaton is in Fig. 2 (e). Note that D™ and -4? are isomorphic. 

if = a-ft'i U cK , K {) C\K l = a(K D K x ) U c[(K n #1) U (if n i^)], 

ifi = aK U cif U e, K n ST = a(i?o n ifi), 

i?o n if 1 = a(if n Ki) U e, 

I^n^ = (i(Ioni^)uc[(^n^) u (Kbnxi)]. 



c c a, c a, c 




Fig. 2. (a) DFA D 2 ; (b) Atomaton _4 2 ; (c) NFA T$; (d) DFA Df 5 ; (e) DFA A\. 

The next theorem from [1], also discussed in [3], will be used several times. 

Theorem 2 (Determinization). If an NFA Af has no empty states and A/" R 
is deterministic, then N n is minimal. 

4 In [3], the intersection A® = Ko fl • • • fl K n -\ was not considered an atom. It was 
shown that the right language of state Ai is the atom Ai, the left language of Ai is 
non-empty, the language of the atomaton A is L, and A is trim. If the intersection 
A^ of all the complemented quotients is non-empty, then A® is an atom and A is no 
longer trim because state A0 is not reachable from any initial state. 
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It was shown in [3] that the atomaton A of L with reachable atoms only 
is isomorphic to the trimmed version of T> RBR , where T> is the quotient DFA 
of L. With our new definition, A is isomorphic to £> RDR . Wc now study this 
isomorphism in detail, along with the isomorphism between _4 R and X) RD . We 
deal with the following automata: 

1. Quotient DFA V — (K, S, S, K in , F) of L whose states are quotient symbols. 

2. The reverse V R = (K, E, S R , F, {K m }) of V. The states in K are still quotient 
symbols, but their right languages are no longer quotients of L. 

3. The determinized reverse D RD = (S, S, a, F, G), where S C 2 K and G = 
{Si G S | Kj„ G Si}. The states in S are sets of quotient symbols, i.e., 
subsets of K. Since (£> R ) R — T> is deterministic and all of its states are 
reachable, T> R has no empty states. By Theorem 2, DFA 2? RD is minimal and 
accepts L R ; hence it is isomorphic to the quotient DFA of L R . 

4. The reverse V m = (S, S, a R , G, {F}) of £> RD ; here the states are still sets 
of quotient symbols. 

5. The atomaton A = (A, S, rj, I, {A m _i}), whose states are atom symbols. 

6. The reverse _4 R = (A, S, i] R , A m _i, I) of A, whose states are still atom 
symbols, though their right languages are no longer atoms. 

The results from [3] and our new definition of atoms imply that A R is a minimal 
DFA that accepts L R . It follows that A R is isomorphic to V Rn . Our next result 
makes this isomorphism precise. 

Proposition 4 (Isomorphism). Let ip : A — > S be the mapping assigning to 
state Aj, given by Aj = K ia fl • • • l~l K in _ r _ ± n K ln _ r fl ■ • • ("1 K in _ 1 of A R , the set 
{K io , . . . ,K in _ r _ 1 }. Then ip is a DFA isomorphism between A R and 2? KD . 

Proof. The initial state A m _i of *4 R is mapped to the set of all quotients con- 
taining e, which is precisely the initial state F of T> m . Since the quotient L 
appears uncomplemented in every initial atom Ai £ /, the image (f(A-i) contains 
L. Thus the set of final states of „4 R is mapped to the set of final states of £> RD . 

It remains to be shown, for all A^, Aj e A and sgS, that rj R (Aj, a) = Ai 
if and only if a(ip(Aj), a) = (p(Ai). 

Consider atom Ai with Pi as the set of quotients that appear uncomple- 
mented in Ai. Also define the corresponding set Pj for Aj. If there is a missing 
quotient Kh in the intersection a~ 1 Ai, we use a~ x Ai n (Kh UKh). We do this for 
all missing quotients until we obtain a union of atoms. Hence Aj £ r/(Ai, a) can 
hold in A if and only if Pj D S(P l , a) and Pj n 8(Q \ Pi, a) = 0. It follows that in 
A R we have n R (Aj,a) — Ai if and only if Pj D S(Pi,a) and PjC\S(Q\Pi, a) = 0. 

Now consider 2? RD . Let Pj be any subset of Q; then the successor set of Pi in 
V is 5{Pi,a). Let S(P,,a) = P k . So in V R , we have P 4 G S R (P k ,a). But suppose 
that state q is not in S(Q,a); then S R (q,a) — 0. Consequently, we also have 
Pi G S R (P k U {q}, a). It follows that for any Pj containing S(Pi,a) and satisfying 
Pj fl S(Q \ Pi, a) = 0, we also have a(Pj,a) — Pi. 

We have now shown that n M (Aj, a) — Ai if and only if a(Pj,a) = Pi, for all 
subsets Pi,Pj G S, that is, if and only if a(ip(Aj), a) = p(Ai). □ 
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Corollary 1. The mapping ip is an NFA isomorphism between A and V 1 



In the remainder of the paper it is more convenient to use the T> repre- 
sentation of atomata, rather than that of Definition 2. 

4 The Witness Languages and Automata 

We now introduce a class {L n \ n > 2} of regular languages defined by the 
quotient DFA's D n given below; we shall prove that the atoms of each language 
L n = L(T> n ) in this class meet the worst-case quotient complexity bounds. 

Definition 3 (Witness). For n ^ 2, let T> n = (Q, E, 5, qo, F), where Q = 
{0, . . . , n — 1}, £ = {a, b, c}, q n = 0, F = {n — 1}, S(i, a) = i + 1 mod n, 
5(0, b) = 1, 5(1, b) = 0, 5(i, b) = i for i > 1, 5(i, c) = i for < i < n - 2, and 
8(n — 1, c) = 0. Let L n be the language accepted by V n . 

For n > 3, the DFA of Definition 3 is illustrated in Fig. 1, and V 2 is the DFA 
of Example 1 (a and b coincide). The DFA V n is minimal, since for ^ i ^ n— 1, 
state i accepts a"~ 1-4 , and no other state accepts this word. 

A transformation of a set Q is a mapping of Q into itself. If t is a trans- 
formation of Q and i £ Q, then is the image of i under t. The set of all 
transformations of a finite set Q is a semigroup under composition, in fact, a 
monoid 7q of n n elements. A permutation of Q is a mapping of Q onto itself. A 
transposition interchanges i and j and does not affect any other elements. 
A singular transformation, denoted by Q), has it — j and ht = h for all h ^ i. 

In 1935 Piccard [6] proved that three transformations of Q are sufficient to 
generate Tq. Denes [4] studied more general generators; we use his formulation: 

Theorem 3 (Transformations). The transformation monoid Tq can be gen- 
erated by any cyclic permutation of n elements together with any transposition 
and any singular transformation. 

In any DFA T> — (Q, S, 5, qo, F), each word w in E + performs a transforma- 
tion on Q defined by S(-,w). The set of all these transformations is the trans- 
formation semigroup of T>. By Theorem 3, the transformation semigroup of our 
witness T> n has n n elements, since a is a cyclic permutation, & is a transposition 
and c is a singular transformation. 

The following result of Salomaa, Wood and Yu [7] concerning reversal is 
restated in our terminology. 

Theorem 4 (Transformations and Reversal). Let V be a minimal DFA 
with n states accepting a language L. If the transformation semigroup of "D has 
n n elements, then the quotient complexity of L R is 2™. 

Corollary 2 (Reversal). For n ^ 2, the quotient complexity of L R is 2". 

Corollary 3 (Number of Atoms of L n ). The language L n has 2™ atoms. 
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Proof. By Corollary 1, the atomaton of L n is isomorphic to the reversed quotient 
DFA of L%. By Corollary 2, the quotient DFA of L% has 2" states, and so the 
empty set of states of L n is reachable in L^. Hence has the empty quotient, 
implying that the intersection of all the complemented quotients is non-empty, 
and so L n has 2™ atoms. □ 

Propositions (Transitions of the Atomaton). Let V n = (Q, E,S, qo, F) 
be the DFA of Definition 3. The atomaton of L n = LiV n ) is the NFA A n = 
(2Q,E,T},I,{n-l}), where 

1. IfS = {0}, then r](S,a) = {0}. Otherwise, 

r}({si, . . . , Sfe}, a) = {si + 1, . . . , Sfc + 1}, where the addition is modulo n. 

2. If {0, 1} n S = 0, then 

(a) rj(S,b) = S, 

(b) #}U5,i)={l}US, 

(c) V ({l}US,b) = {0}US, 

(d) #,1}US,^{0,1}US. 

3. If{0,n-l}nS = 0, then 

(a) n(S,c) = {S,{n-l}US}, 

(b) »y({0, n - 1} U S, c) = {{0, n-l}uS, {0} U S}, 

(c) #}US,c)=«, 

(d) >,({n-l}uS,c) = i. 

Proof. The reverse of DFA V n is the NFA = (Q, E, 6 R , {n - 1}, {0}), where 
S R is defined by 5 R (i, a) = i - 1 mod n, (5 R (i, 6) = 6), (5 R (0, c) = {0, n - 1}, 
5 m (n— 1, c) = 0, and <5 R (i, c) = i, for < i < n—1. After applying determinization 
and reversal to £> R , the claims follow by Corollary 1. □ 

5 Tightness of the Upper Bounds 

We now show that the upper bounds derived in Section 2 are tight by proving 
that the atoms of the languages L n of Definition 3 meet those bounds. 

Since the states of the atomaton A n — (A, S, 77, 1, {A m _i}) are atom symbols 
Aj, and the right language of each Aj is the atom Aj, the languages A t are 
properly represented by the atomaton. Since, however, the atomaton is an NFA, 
to find the quotient complexity of Aj, we need the equivalent minimal DFA. 

Let T> n be the n-state quotient DFA of Definition 3 for n ^ 2, and recall that 
L(T> n ) = L n . In the sequel, using Corollary 1, we represent the atomaton A n of 
L n by the isomorphic NFA 2?™ R = (S, E, a R , G, {F}), and identify the atoms by 
their sets of uncomplemented quotients. To simplify the notation, we represent 
atoms by the subscripts of the quotients, that is, by subsets of Q — {0, . . . , n— 1}, 
as in Definition 3. 

In this framework, to find the quotient complexity of an atom Ap, with 
P C Q, we start with the NFA A P = (S, E, a R , {P}, {F}), which has the same 
states, transitions, and final state as the atomaton, but has only one initial 
state, P, corresponding to the atom symbol Ap. Because .4 R is deterministic 
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and Ap has no empty states, A P is minimal by Theorem 2. Therefore, A P is 
the quotient DFA of the atom Ap. The states of A P are certain sets of sets of 
quotient symbols; to reduce confusion we refer to them as collections of sets. 
The particular collections appearing in A P will be called "super-algebras" . 

Let U be a subset of Q with \ U\ = u, and let V be a subset of U with \V\ = v. 
Define (V)u to be the collection of all 2 U ~ V subsets of U containing V. There are 
C"C" collections of the form (V)u, because there are C™ ways of choosing U, 
and for each such choice there are C" ways of choosing V. The collection (V)u is 
called the super-algebra of U generated by V. The type of a super-algebra (V)u 
is the ordered pair (|V|, \U\) = (v,u). 

The following theorem is a well-known result of Piccard [6] about the group — 
known as the symmetric group — of all permutations of a finite set: 

Theorem 5 (Permutations). The symmetric group of size n! of all permuta- 
tions of a set Q = {0, ... ,n — 1} is generated by any cyclic permutation of Q 
together with any transposition. 

Lemma 1 (Strong- Connectedness of Super- Algebras). Super- algebras of 
the same type are strongly connected by words in {a, b}* . 

Proof. Let (Vi)^ and (V^)^ be any two super-algebras of the same type. Ar- 
range the elements of V\ in increasing order, and do the same for the elements of 
the sets V2, U\\V\, Ui\ V2, Q \ U\, and Q \ £/ 2 . Let tt : Q — > Q be the mapping 
that assigns the ith element of V2 to the ith element of V\, the ith element of 
U2 \ V2 to the ith element of U\ \V\, and the ith element of Q \ U2 to the ith 
clement of Q \ U\. For any R\ such that V\ C Ri C Ui, there is a corresponding 
subset i?2 = tt(-Ri), where V2 C R 2 C J/2- Thus n establishes a one-to-one corre- 
spondence between the elements of the super-algebras (V\)u 1 and (V^)^. Also, 
7r is a permutation of Q, and so can be performed by a word w £ {a, b}* in £>„, 
in view of Theorem 5. Thus every set R2 defined as above is reachable from Ri 
by w. So (V2)u 2 1S reachable from (Vi)^- □ 

Lemma 2 (Reachability). Let (V)u be any super-algebra of type (v,u). If 
v ^ 2, then from (V)u we can reach a super-algebra of type (v—l,u). Ifu ^ n — 2, 
then from (V)u we can reach a super-algebra of type (v,u + 1). 

Proof. If v > 2, then by Lemma 1, from (V")t/ we can reach a super-algebra 
(V')u' of type (v, u) such that {0, n— 1} C V"'. By input c we reach (V'\{n—l})u< 
of type (i> — 1 , u) . For the second claim, if u ^ n — 2 , then by Lemma 1 , from (V) u 
we can reach a super-algebra (V')u' of type (v, u) such that {0, u-l}ny' = 0. 
By input c we reach (V')t/'u{n-i} °f type (w, w + 1). □ 

The next proposition holds for n ^ 1 if we let £1 = Z"*. 

Proposition 6 (Atoms with or n Complemented Quotients). 

For n > 1, t/ie quotient complexity of the atoms Aq and A® of L n is 2™ — 1. 
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Proof. Let Aq {Aid) be the modified atomaton with only one initial state, Q (0). 
By the considerations above, Aq (Aq) is the quotient DFA of Aq (Aq); hence 
it suffices to prove the reachability of 2™ — 1 collections. 

For Aq, the initial state of Aq is the collection {Q}, which is the super- 
algebra (Q)q of Q generated by Q. Now suppose that we have reached a super- 
algebra of type (v,n). By Lemma 1, we can reach every other super-algebra of 
type (v,n). If v ^ 2, then by Lemma 2 we can reach a super-algebra of type 
(v — l,n). Thus we can reach all super-algebras (V)q of Q, one for each non- 
empty subset V of Q. Since there are at most 2" — 1 collections and that many 
can be reached, no other collection can be reached. 

For Aq, the initial state of A® is the empty collection, which is the super- 
algebra (0)0 of generated by 0. Now suppose we have reached a super-algebra of 
type (0,u). By Lemma 1, we can reach every other super-algebra of type (0, u). 
If u ^ n — 2, then by Lemma 2 we can reach a super-algebra of type (0, u + 1). 
Thus we can reach all super-algebras (0}[/, one for each non-empty subset U 
of Q. Since there are at most 2™ — 1 collections and that many can be reached, 
no other collection can be reached. 

Hence the proposition holds. □ 

Proposition 7 (Tightness). For n ^ 2 and 1 ^ r ^ n — 1, the quotient 
complexity of any atom of L n with r complemented quotients is f(n,r). 

Proof. Let Ap be an atom of L n with n—r uncomplemented quotients, where 1 $C 
r < n — 1, that is, let P be the set of subscripts of the uncomplemented quotients. 
Let Ap be the modified atomaton with the initial state P. As discussed above, 
A P is minimal; hence it suffices to prove the reachability of f(n,r) collections. 

We start with the super-algebra (P) p with type (n — r, n — r). By Lemmas 1 
and 2, we can now reach all super-algebras of types 

(n — r, n — r), (n — r — 1, n — r), . . . , (1, n — r), 

(n — r, n — r + 1), (n — r — 1, n — r + l),...,(l,n — r + 1), 

(n — r, n — 1), (n — r — 1, n — 1), . . . , (1, n — 1). 
Since the number of super-algebras of type (v,u) is C™C", we can reach 

n — 1 n—r 



g(n,r)= £ ^Q-C" 



u—n—r v—1 

algebras. Changing the first summation index to k = n — u, we get 



r 7i — r 



g(n,r)^J2J2 C n-k- C "~ k - 



fc=i «=i 



Note that C" ,C"- fe = C?,C k + v , because C" i ,C"- fc = , ,., • ,, (n ~ fc)! ,, = 

n\ „ nr | (~in f-ik+v _ n) ( fc +") ! _ n\ Nnw w p pan 

k\v\(n-k-vy.' anCl °fc+t>°fc - (fc+iOKn-fc-t,)! TiT ~ {n-k-v)\k\v\ ' iN0W ' We Can 
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write g{n, r) = J2k=i S"=i ^k+v ' Cfc + "' an< ^ changing the second summation 
index to h = k + v, we have 

r fc+rt— r 

<?(n,r) = £ E ^ ' C t 
fe=i h=fe+i 

We notice that g(n,r) — f(n,r) — 1. From the super-algebra (V)y, where V = 
{0, 1, . . . , n — r — 1}, we reach the empty quotient by input c, since V contains 
0, but not n — 1. 

Since we can reach /(n, r) super-algebras, no other collection can be reached, 
and the proposition holds. □ 

The entire process of finding the complexity of atoms is illustrated in the 
example below for n = 3. 

Example 2. Let L 3 be the language accepted by the quotient DFA T> 3 of Defi- 
nition 3 and Table 2, where the initial state is identified by an incoming arrow 
and the final state, by an outgoing arrow. The first column consists of states q, 
and the remaining columns give the values of 8(q, x) for each x G E. Let the 
quotients of L3 be Kq = L3 = e^ 1 L^, K\ = a _1 L3, and K2 = (aa)~ l L?,. The 
states of 2?3 are subscripts of quotient symbols. 

Reversing X> 3 , we obtain the NFA Vf of Table 3. The states of Vf arc the 
same as those of 2? 3 , but the transitions are to sets of states, and 02 stands for 
{0,2}, stands for {0}, etc. 



Table 2. Quotient DFA V 3 of L 3 . 



Table 3. NFA D? for L?. 







a 


b 


c 




— > 





1 


1 









1 


2 





1 






2 





2 





-> 







a 


b 


c 









2 


1 


02 


— ¥ 




1 








1 




-> 


2 


1 


2 








Next, we perform the subset construction on ©3 to determinize it and get 
the DFA P™, the quotient DFA for L§. Since Vf is trim, and (Vf) R = V 3 is 
deterministic, the DFA T>f° shown in Table 4 is minimal by Theorem 2. 

The states of T>f° are sets of (subscripts of) quotient symbols. Now we reverse 
Vf° to get Vf° M of Table 5, which is isomorphic to the atomaton .A3. The states 



of 2?* DK are still sets of (subscripts of) quotient symbols. Note that the empty 
set of quotient symbols is a state of Vf v , and hence also of A 3 . It is not to be 
confused with the empty set of transitions associated with states 0, 2, 01, and 
12 under input c indicated by — . 
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Table 4. DFA Vf° for L§. 



Table 5. Atomaton A 3 = T>f l 







a 


b 


c 

























2 


1 


02 






1 








1 




—¥ 


2 


1 


2 









01 


02 


01 


012 


— > 




02 


12 


12 


02 


— > 




12 


01 


02 


1 






012 


012 


012 


012 


-> 







a 


b 


c 















0,2 




— »■ 





1 


1 








1 


2 





1,12 






2 





2 




— > 


— >• 


01 


12 


01 






— ¥ 


02 


01 


12 


0,02 






12 


02 


02 








012 


012 


012 


01,012 





Table 6. DFA £> i2 of ,4oi2- 







a 


b 


c 




— ¥ 


012 


012 


012 


01,012 






01,012 


12,012 


01,012 


01,012 






02,012 


01,012 


12,012 


0,01,02,012 






12,012 


02,012 


02,012 


01,012 






0,01,02,012 


1,01,12,012 


1,01,12,012 


0,01,02,012 






1,01, 12,012 


2,02,12,012 


0,01,02,012 


1,01,12,012 






2,02, 12,012 


0,01,02,012 


2,02,12,012 


0,01,02,012 


—¥ 



Atom A012 = Kq n K\ n K 2 is the language accepted by started in state 
012. The states of X>oi2, the minimal DFA of A012, are collections of sets of 
quotients. As seen from Table 6, the quotient complexity of Aq\2 is seven. 

Atom Aqi = KonKinK2 is accepted by ^3 started in state 01. The minimal 
DFA T>oi of Aqi is shown in Table 7, and the quotient complexity of Aqi is ten. 
Since 01, 12 and 02 are strongly connected by a, the same collections are reached 
from these states, and so the quotient complexity of A 12 and A 02 is also ten. 

Atom A 2 = K (~1 K 1 n K 2 is accepted by A3 started in state 2. The minimal 
DFA T>2 of A 2 is shown in Table 8, and the quotient complexity of A 2 is ten. 
Since 0, 1 and 2 are strongly connected by a, the same collections are reached 
from these states, and so the quotient complexity of A and Ai is also ten. 

Finally, atom A$ = K fl K\ n K 2 is accepted by ^3 started in state 0. The 
minimal DFA V% is shown in Table 9, and the quotient complexity of A% is 
seven. Note that P012 and T>% have isomorphic transition tables, if we ignore 
final states. The isomorphism is ip : 2 2<3 — ¥ 2 2<3 defined as follows: If C C 2^ is 
a collection of subsets of Q, then ip{C) = {Q\S \ S £ C}. 
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Table 7. DFA V i of A 01 . 



Table 8. DFA T> 2 of A 2 . 







a 


b 


c 




















— ¥ 


01 


12 


01 









02 


01 


12 


0,02 






12 


02 


02 









0,01 


1,12 


1,01 









0,02 


1,01 


1, 12 


0,02 






1,01 


2, 12 


0,01 


1, 12 






1,12 


2,02 


0,02 


1,12 






2,02 


0,01 


2, 12 


0,02 






2, 12 


0,02 


2,02 





-> 



Table 9 







a 


b 


c 

























1 


1 









1 


2 





1,12 




— ¥ 


2 





2 





— ► 




0,01 


1,12 


1,01 









0,02 


1,01 


1, 12 


0,02 






1,01 


2, 12 


0,01 


1, 12 






1,12 


2,02 


0,02 


1,12 






2,02 


0,01 


2, 12 


0,02 


-¥ 




2, 12 


0,02 


2,02 





-> 



. DFA £> of ^1 . 







a 


b 


c 




— > 











0,2 






0,0 


0,1 


0,1 


0,2 






0,1 


0,2 


0,0 


0,1,2,12 






0,2 


0,0 


0,2 


0,2 


— ¥ 




0,0,1,01 


0,1,2,12 


0,0,1,01 


0,1,2,12 






0,0,2,02 


0,0,1,01 


0,1,2,12 


0,0,2,02 


—¥ 




0,1,2,12 


0,0,2,02 


0,0,2,02 


0,1,2,12 


-¥ 



6 Conclusions 

The atoms of a regular language L are its basic building blocks. We have studied 
the quotient complexity of the atoms of L as a function of the quotient complexity 
of L. We have computed an upper bound for the quotient complexity of any atom 
with r complemented quotients, and exhibited a class {L n } of languages whose 
atoms meet this bound. 

Acknowledgments We are grateful to Baiyu Li for writing a program for evalu- 
ating the quotient complexity of atoms. We thank Eric Rowland and Jeff Shallit 
for computing the ratio defined for Table 1 for some large values of n. 



References 

1. Brzozowski, J.: Canonical regular expressions and minimal state graphs for definite 
events. In: Proceedings of the Symposium on Mathematical Theory of Automata. 
Volume 12 of MRI Symposia Series, Polytechnic Press, Polytechnic Institute of 
Brooklyn, N.Y. (1963) 529-561 

2. Brzozowski, J.: Quotient complexity of regular languages. J. Autom. Lang. Comb. 
15(1/2) (2010) 71-89 



16 



3. Brzozowski, J., Tamm, H.: Theory of atomata. In Mauri, G., Leporati, A., eds.: Pro- 
ceedings of the 15th Int. Conference on Developments in Language Theory (DLT). 
Volume 6795 of Lecture Notes in Computer Science, Springer (2011) 105-117 

4. Denes, J.: On transformations, transformation semigroups and graphs. In Erdos, 
P., Katona, C, eds.: Theory of Graphs. Proceedings of the Colloquium on Graph 
Theory held at Tihany, 1966, Akademiai Kiado (1968) 65-75 

5. Nerode, A.: Linear automaton transformations. Proc. Amer. Math. Soc. 9 (1958) 
541-544 

6. Piccard, S.: Sur les fonctions definies dans les ensembles finis quelconques. Fund. 
Math. 24 (1935) 298-301 

7. Salomaa, A., Wood, D., Yu, S.: On the state complexity of reversals of regular 
languages. Theoret. Comput. Sci. 320 (2004) 315-329 

8. Yu, S.: State complexity of regular languages. J. Autom. Lang. Comb. 6 (2001) 
221-234 



17 



